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We analyze the use of a driven nonlinear cavity to make a weak continuous measurement of 
a dispersively-coupled qubit. We calculate the backaction dephasing rate and measurement rate 
beyond leading-order perturbation theory using a phase-space approach which accounts for cavity 
noise squeezing. Surprisingly, we find that increasing the coupling strength beyond the regime de- 
scribable by leading-order perturbation theory (i.e. linear response) allows one to come significantly 
closer to the quantum limit on the measurement efficiency. We interpret this behaviour in terms of 
the non-Gaussian photon number fluctuations of the nonlinear cavity. Our results are relevant to 
recent experiments using superconducting microwave circuits to study quantum measurement. 



Introduction- There is considerable interest in exploit- 
ing continuous weak quantum measurements for the de- 
tection of fundamental quantum behavior as well as for 
quantum information processing [1-3]. By weak mea- 
surement, we mean the generic situation where the signal 
produced in the detector by the measured system is small 
compared to intrinsic output noise, and thus information 
is obtained only gradually in time. Such measurements 
are ultimately constrained by the Heisenberg uncertainty 
principle, which dictates that the backaction disturbance 
of the system by the detector cannot be arbitrarily small, 
but is instead bounded by the rate at which information 
is acquired [1-5]. Detectors capable of yielding an op- 
timally small ratio of backaction-to-information gain are 
known as quantum-limited. They are both of fundamen- 
tal interest, and are also necessary if one wishes to im- 
plement continuous quantum feedback algorithms [6-8] 
or certain quantum error correction schemes [9] . 

Not surprisingly, weak measurements are usually an- 
alyzed in the limit of a system-detector coupling small 
enough that leading-order perturbation theory in the 
coupling applies; in this standard regime, the quantum 
limit reduces to a constraint on the noise properties of 
the detector [3, 4]. Here, we focus on an alternate regime, 
where the detector-system coupling is still weak enough 
that information is obtained gradually in time, but not so 
weak that leading-order perturbation theory is sufficient. 
This regime of a "weak-but-not-too-weak" measurement 
has recently been achieved in experiments using a driven, 
nonlinear superconducting microwave cavity to measure 
the state of a superconducting qubit [10]. As with exper- 
iments using linear microwave cavities [11, 12], the qubit 
is dispersively coupled to the cavity, meaning that the 
cavity frequency depends on the qubit state; by moni- 
toring the phase of reflected microwaves from the cavity, 
one can monitor the qubit state. Introducing a nonlin- 
earity in the cavity via a Josephson junction (see Fig. 1) 
allows one to operate the cavity detector close to a point 
of bifurcation, where the state of the driven cavity is an 
extremely sensitive (but still single- valued) function of its 
frequency The enhanced sensitivity of this regime natu- 
rally leads to conditions where the measurement is weak, 



but the qubit-detector coupling cannot be treated pertur- 
batively. While information gain is enhanced here, the 
question remains whether this speedup comes at the cost 
of deviating from the quantum limit (i.e. excess backac- 
tion dephasing). 

In this Letter, we present an analytic theory describ- 
ing weak measurement of a qubit with a nonlinear cav- 
ity operated close to a point of bifurcation. Our non- 
perturbative approach accounts for the non-trivial cavity 
noise physics associated with the nonlincarity. We find 
that the information-gain to state-disturbance ratio is a 
strong function of the qubit-dctcctor coupling strength. 
In the limit of an extremely weak qubit-detector cou- 
pling, we recover previous perturbative results [13-16], 
which indicate a large deviation from the quantum limit: 
the backaction dephasing rate is a large factor G greater 
than the rate of information acquisition (the measure- 
ment rate), where G ^> 1 is the parametric photon- 
number gain associated with the driven nonlinear cavity. 
Increasing the coupling beyond the perturbative regime, 
we find remarkably that the dephasing rate is greatly sup- 
pressed compared to the leading-order prediction. This 
allows one to approach the quantum limit to within a 
factor of order unity. Our approach provides a general 
framework for investigating quantum measurement with 
driven nonlinear systems beyond weak coupling. 

Note that the backaction of a nonlinear cavity detec- 
tor was also considered by Boissonneault and co-workers 
[10, 15, 16]. They described backaction dephasing beyond 
lowest-order in the coupling by approximating the state 
of the driven cavity state conditioned on the qubit state 
to be a simple coherent state; this is only valid for op- 
erating points far from a bifurcation, where the detector 
has a relatively small gain. We show that close to a bi- 
furcation (where the cavity exhibits parametric gain and 
squeezing) , this approach does not accurately capture the 
coupling dependence of the backaction dephasing. 

Model- We consider a qubit coupled dispersively to 
a single-sided nonlinear cavity. While our approach ap- 
plies to an arbitrary nonlinearity, we focus here on the 
typical experimental situation [10, 17] where a Kerr-type 
nonlinearity dominates. Working in a frame rotating at 
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the cavity drive frequency Wd, the Hamiltonian is (U = 1) 

H sys = -Aa'a - Aatataa + H K + H qh + H int , (1) 

where A = Wd — w rav is the detuning between the cavity 
drive and resonance frequency, A is the Kerr constant, 
H K describes the cavity damping (with rate k) and driv- 
ing due to coupling to external modes, and H q b = Q<J Z 
is the qubit Hamiltonian. The dispersive QND qubit- 
cavity coupling Hamiltonian is H mt — Xa z a'a, where the 
coupling strength A sets the qubit-dependent cavity fre- 
quency shift. We will take the cavity to be at zero tem- 
perature, and ignore any intrinsic qubit dissipation, as 
we are interested only in the measurement backaction. 

As discussed, making a weak o~ z measurement of the 
qubit involves strongly driving the cavity while monitor- 
ing the reflected light from the cavity via a homodyne 
measurement; the two possible values of a z will lead to 
two different average homodyne currents, which as time 
progresses can be resolved above the intrinsic noise in 
these currents [3, 12]. We will focus exclusively on a 
weak nonlinearity A <C k and a strong drive amplitude, 
such that the stationary average cavity photon number 
(a^a) ^> 1 regardless of the initial qubit state. Apart 
from this constraint, we will not place any other restric- 
tions on how small the qubit-cavity coupling A must be. 

Dephasing rate- We first calculate the backaction de- 
phasing of the qubit that occurs during such a measure- 
ment; this dephasing is a direct consequence of the in- 
tracavity photon-number fluctuations. If the qubit is ini- 
tially in a a z eigenstate, it will remain in this state (due 
to the QND nature of the measurement); thus, from the 
cavity's perspective, the two qubit eigenstates simply cor- 
respond to a shift of the cavity resonance frequency by 
either ±A. In each case, the classically-expected cavity 
amplitude a a (a =f, I) will be given by 



i(A T A + 2A|a m | 



-i/o, (2) 



where /o is the amplitude of the cavity drive. Note that 
we focus on driving strengths small enough that we are 
below the bifurcation, i.e. there is only one classical so- 
lution a for a given A. By now writing a = d + a a and 
using the fact that \a a \ >• 1, we can approximate the 
cavity Hamiltonian corresponding to each qubit eigen- 
state by only keeping terms that are at most quadratic 
in d, d) . We thus obtain two linearized cavity Hamiltoni- 
ans, corresponding to the two qubit states. Each has the 
general form of a degenerate parametric amplifier (DPA) 
driven by an off-resonant pump [14, 18, 19]: 



H a = -A a Sd- 



l -(~gJ^S -~g*Jd), (3) 



where A-j-/^ = A =p A + 4|a.j./J 2 A is the effective pump 
detuning, and g a = — 2ia 2 A is the parametric strength. 
As one approaches a point of bifurcation in the cavity 



(e.g. by increasing the drive strength / ), the correspond- 
ing DPA Hamiltonian approaches the threshold of self- 
oscillation [14] (see inset of Fig. 2). For such operating 
points, incident waves on the cavity in the appropriate 
quadrature will be strongly amplified; this amplification 
is described by a photon number gain G which diverges 
as one approaches the bifurcation, as well as a narrow- 
bandwidth K s i ow ~ k/VG [14, 17, 19]. 

While the cavity evolution is easy to understand when 
the qubit is initially in a a z eigenstate, to calculate the 
dephasing rate we need to understand the cavity dynam- 
ics when the qubit is in a superposition of its eigenstates. 
We focus on the long-time qubit dephasing rate, which is 
defined as usual in terms of the decay of the qubit 's off- 
diagonal density matrix elements in the long-time limit: 
— | In TV (p| 4-)(t |) IA — >• r v , where p is the density matrix 
describing the full system. 

To proceed, we first introduce p^ = Tr q b,bath(p| 4-)(t 
|), where the trace is over the qubit and cavity bath de- 
grees of freedom. This is an operator acting in the cavity 
Hilbert space; its trace yields the off-diagonal element of 
the qubit density matrix, and hence can be used to ob- 
tain r^. We further transform p-j-j. by displacing away the 
two stationary classical cavity amplitudes a a associated 
with each qubit state. We thus obtain a operator p-j-j.: 



~p n (t) = D(-a t )p n (t)D\- ai ), 



(4) 



where D{a) — exp(aa^ — h.c.) is the cavity displacement 
operator. One can show that in the long-time limit, the 
exponential decay of Tr pj-^ (t) also yields the dephasing 
rate T v [20]. 

It is now straightforward to rigorously derive the evolu- 
tion equation of p-j-j., starting from the standard Linblad 
master equation describing the evolution of the cavity- 
plus-qubit density matrix [20] (see also [16]): 

d . . . .... 

-p n = nV[d]p n - i(H t p n - p n ffj - T vfi p n 



3t f 



(a t - a^puS - (a\ - a\)dp n . (5) 



Here T>[d]p^i = dp^^d) — (d^dp-fi + p^^d)d)/2 is the stan- 
dard Lindblad super-operator describing cavity damping. 
The second and third terms in Eq. (5) correspond to 
the Hamiltonian evolution of the cavity in our doubly- 
displaced frame, where we have used \a a \ >• 1 to lin- 
earize the two cavity Hamiltonians H a (c.f. Eq. (3)). 
The remaining terms on the RHS of Eq. (5) describe 
decoherence of the qubit resulting from the combina- 
tion of the cavity drive and cavity dissipation, with 
r v>0 - (n/2)\a^- ai \ 2 . 

For a linear cavity, the terms on the last line of Eq. (5) 
play no role, and the backaction dephasing rate is given 
completely by r Vj o (i.e. by the distinguishability of the 
two classical cavity amplitudes) [21]. For our case of a 
nonlinear cavity, the same is true if one neglects the para- 
metric amplification terms in H a (proptional to d 2 and 
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(Jt) 2 ), as then p n (t) = Cexp(-r pi0 t)|0)(0| (where |0) 
is the vacuum state and C a constant) trivially solves 
Eq. (5). This is equivalent to finding that (in the long 
time limit) the cavity state conditioned on the qubit is 
a coherent state \a a ). In this approximation, the back- 
action dephasing rate is given completely by the linear- 
cavity formula T v0 [10, 15, 16]. However, such an ap- 
proximation completely neglects the squeezing of noise 
by the nonlinear cavity. It is thus only valid for cavity 
parameters that are extremely far from any bifurcation, 
in regimes where the nonlinear cavity closely resembles a 
linear cavity. 

Given the importance of noise squeezing, we go beyond 
the above approximation by retaining all terms in Eq. (5). 
Defining v(t) = — lnTrp^(i), the long-time backaction 
dephasing rate will be given by — lim ^^Re v(t)/t. 
Setting 5a — a^ 



= r 



<p,0 



0|_, the trace of Eq. (5) yields 
+ k ( 5a ■ d) — h.c. 



n 



n 



,(6) 



where we have defined the quasi-expectation value 

(&)n = Tr (^Pn)/ Tr (Pn)- As f>n is not a true densit y 

matrix, the quasi-expectation of a Hermitian operator 
can be complex, and hence the second third terms above 
can contribute to the backaction dephasing. 

We now use the fact that Eq. (5) only involves terms 
that are at most quadratic in d,d\ and hence can be 
solved exactly by a which has a a Gaussian form 
(i.e. its phase space representation is Gaussian [20]). 
Eq. (5) thus reduces to a closed set of evolution equations 
for the quasi-means and covariances of d, cP (see [20] for 
details). Solving these and substituting into Eq. (6) di- 
rectly gives v and thus the dephasing rate. We stress 
that this approach is not perturbative in the coupling A, 
and it does not neglect the noise squeezing expected near 
a cavity bifurcation. A similar procedure can be used to 
calculate the backaction dephasing of a linear cavity sub- 
ject to both quantum and thermal noise [22]. 

To gain insight on the effect of increasing A, we first 
use the above approach to calculate T v to order A 4 . For 
a cavity detuning A and drive /o chosen to be close to 
the bifurcation point, one finds 



G 



G 3/2 
9n 



A 2 



32 
27 



G 



5 G 9 ' 2 
81 n 



(7) 

where n = |cto | 2 , &o is the zero-coupling classical cavity 
amplitude (i.e. solution to Eq. (2) at A = 0), and G 3> 1 
is the parametric photon- number gain (see Ref. [14]). 
At each order in A, we have retained the leading terms 
in n and G. The first term here (oc A 2 n) reproduces 
the results of Ref. [10, 14, 16] and arises solely from 
the linear-cavity dephasing rate T^.o m Eq. (6). The 
second term (also order A 2 ) is missed if one linearizes 
the qubit-cavity interaction, or makes the approximation 
Tip = r Vi o- More interesting are the leading A 4 terms. 
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FIG. 1: (a) Inset: schematic showing a Josephson-j unction 
circuit (i.e. nonlinear cavity) dispersively coupled to a qubit. 
Main: Measurement rate and dephasing rate versus qubit cou- 
pling strength A, in units of the cavity damping rate n, us- 
ing logarithmic axes. The blue (dot-dashed) curve is the full 
measurement rate r meas , while the light-blue (doted) curve is 
the linear-response approximation to r meas . The red (solid) 
line is the dephasing rate Y v as obtained from the full theory 
presented in the main text. The remaining lines are T v calcu- 
lated within less rigorous approximations: the grey (dotted) 
curve is the linear-cavity formula T^.o (c.f. Eq. (5)) and green 
(dashed) curve is leading-order perturbation theory. Param- 
eters are A = 1CT 3 k, f = 0.75/bif,n ~ 200, A = A M f where 
/bif and Abif are the driving force amplitude and detuning at 
the cavity bifurcation. The parametric photon number gain 
is G ~ 10 2 . One clearly sees that for moderate couplings, 
the dephasing rate is strongly suppressed compared to the 
perturbative result, (b) Same, but using linear axes. 



Surprisingly, this correction is negative, suggesting the 
possibility of a relative suppression of dephasing with in- 
creased coupling (relative to the lowest-order-in-A expres- 
sion). The leading A 4 corrections are completely due to 
the last line of Eq. (6), terms that would vanish if one 
ignored the squeezing of noise near the bifurcation. As 
such, the approximation T v ~ T^.o would predict both 
an incorrect sign and scaling with G of this term. 

To see the full consequence of this behaviour, we nu- 
merically solve Eq. (6) and the ODE's determining the 
needed averages; we use parameters corresponding to 
a weakly nonlinear cavity operated near a bifurcation 
point, similar to those realizable in experiment [10, 17], 
and well within the regime of validity of our theory. In 
Fig. 1, one sees clearly that the backaction dephasing 
rate as a function of coupling (red) drops markedly be- 
low both the expectations from lowest-order perturba- 
tion theory (green), and below the linear-cavity formula 
r^.o (grey). We have confirmed that this behaviour is 
generic whenever one is close to a bifurcation in the cav- 
ity: higher- order-in-X terms yield a marked suppression 
of the dephasing rate. 

For a heuristic understanding of the above behaviour, 
we return to the physical picture that dephasing of the 
qubit is due to the photon-number fluctuations of the 
driven cavity. Treating these fluctuations classically and 
defining m(t) = J Q dt 1 n(t') (where n is the cavity photon 
number), one finds that the off-diagonal qubit density 



4 



matrix is directly proportional to the characteristic func- 
tion of the probability distribution of to [23, 24]. As such, 
the long-time qubit dephasing rate can be expressed in 
terms of the even cumulants of m, ((to 2j )): 

r v = iim i£)(_iy-iW<< ro *». ( 8 ) 

This expansion also holds in the quantum case where to 
is an operator, if one now interprets the cumulants above 
using the standard Keldysh operator ordering [23, 24]. 

Eq. (8) implies that terms of order A 4 and higher in T v 
are directly due to the non-Gaussian nature of intracav- 
ity photon number fluctuations. In particular, having a 
negative contribution to T v at order A 4 requires a posi- 
tive kurtosis ((to 4 )). We note that even a driven linear 
cavity in the classical limit has non-Gaussian intra-cavity 
photon number fluctuations and a positive kurtosis; this 
is a simple consequences of n being the square of a Gaus- 
sian random variable (i.e. the cavity amplitude) [24]. A 
positive kurtosis indicates a distribution which is more 
peaked than a Gaussian, and hence noise that would 
generate less dephasing than truly Gaussian noise. In 
our nonlinear resonator, the non-Gaussian nature of the 
photon number fluctuations is strongly enhanced near bi- 
furcation by the intrinsic nonlinearity of the system. The 
kurtosis remains positive (like a linear resonator), but is 
much larger than would be expected for even a degener- 
ate parametric amplifier near threshold [20]. 

Measurement rate and quantum limit- For a measure- 
ment that occurs slowly on detector timescales, we can 
characterize the information gain of the measurement by 
a single measurement rate r meas : how quickly do the dis- 
tributions of the output homodyne current corresponding 
to each qubit eigenstate (t or \) become distinguishable. 
Generalizing the standard weak-coupling expression [1- 
3] to a situation where the coupling is not perturbative 
but the measurement is still slow compared to internal 
detector timescales, we find (see [20] for details) 

r = (/t - ii)2 m 

1 mcas , (Q , c V W 

Here I a is the average stationary homodyne current when 
the qubit is in the state a =~\, \,, and similarly, Su t(X is 
the zero-frequency spectral density of homodyne current 
fluctuations when the qubit is frozen in the state a. One 
can rigorously show that for arbitrary A, the measure- 
ment efficiency ratio x = Tmoas/r^ < 1 [20]. 

Using the linearized Hamiltonians H a given in Eq. (3) 
along with standard input-output theory [25] lets us eval- 
uate Eq. (9) for an arbitrary value of the coupling; com- 
paring against the dephasing rate then allows us to inves- 
tigate the behaviour of x as a function of coupling. One 
finds that similar to the linear-cavity dephasing rate r Vj o, 
the measurement rate is largely determined by the clas- 
sical amplitudes a a , and is hence a far weaker function 
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FIG. 2: Measurement efficiency ratio \ = Tmoas/r^ as a 
function of coupling strength for the same parameters as 
Fig. 1 (G — 10 2 , red-solid curve), and for an operating point 
closer to the bifurcation yielding G ~ 10 3 (blue dashed solid 
curve); the quantum limit is y = 1 (dashed-black line). Ver- 
tical lines indicate the maximum A for which the measure- 
ment time l/r meas is longer than the detector response time 
1/ftslow ~ VG/k. For A — > 0, one misses the quantum limit 
by a large amount: y ~ 1/G. However, a small increase in 
coupling greatly improves this efficiency ratio. Inset: Average 
cavity photon number versus drive detuning A (parameters 
as in Fig. 1). The black point indicates the chosen working 
point. The two qubit states lead to two different effective 
values of A; these are shown as white circles for A = 0.05k. 

of A than the dephasing rate. The result is that there is 
a range of A where higher-order terms significantly sup- 
press the dephasing rate (over the perturbative expres- 
sion), whereas the measurement rate is still determined 
by the leading-order expression (see Fig. la). 

Shown in Fig. 2 is x versus A for the same parameters 
as in Fig. 1. In the limit of a vanishing coupling strength, 
one deviates strongly from the quantum limit [14]. How- 
ever, the effective suppression of dephasing that occurs 
with a modest increase of coupling brings one within a 
factor of order unity of the ultimate quantum limit bound 
X = 1. We find that this behaviour is generic for cav- 
ity operating points near bifurcation: increasing the cou- 
pling A beyond the validity of leading-order perturbation 
theory allows one to make a weak measurement with a 
much higher efficiency than in the extreme weak coupling 
limit. On a physical level, this is a direct result of the 
non-Gaussian nature of photon number fluctuations in 
the driven cavity (namely, the large positive kurtosis). 

Conclusions- We have described a general method to 
calculate the measurement and dephasing rate of a qubit 
coupled to a nonlinear resonator that is not perturbative 
in the qubit-detector coupling and which accounts for 
cavity noise squeezing. By increasing the coupling to a 
regime where higher-order corrections are relevant, one 
can come significantly closer to the fundamental quantum 
limit on weak continuous qubit measurement. 

This work was supported by NSERC and CIFAR. 
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EQUATIONS OF MOTION FOR DOUBLY-DISPLACED CAVITY DENSITY MATRIX 

The standard Lindblad master equation describing the evolution of the qubit-plus-cavity density matrix p is: 

-^p = — i[— Ad^d — Ka)a)aa, p] + ifo[a + a\p] — i\[a^d ■ a z ,p] + nT>[a\p, (SI) 

where V{d]p = dpitf — {d) dp+ pd) d)/2 is the standard Lindblad super-operator describing cavity damping. Multiplying 
both sides of this equation by | \) (t | and then tracing over the qubit degrees of freedom yields the equation of motion 
for pf±. We are interested in the evolution of the displaced density operator p-^ which is obtained by displacing p-j-j. 
by different amounts on the left and right, as per Eq. (4). We thus displace Eq. (SI) in the same way; this directly 
yields the equation of motion for p^ given in Eq. (5). Note that while the underlying master equation in Eq. (SI) is 
in Linblad form, the final evolution equation for the double-displaced density matrix p^ is not in Linblad form (even 
though it followed directly from Eq. (SI)). This is of no great concern, as p-fi is not required to be a positive operator. 

To work with this displaced master equation we express p-^ in terms of its Wigner representation. Defining the 
cavity quadratures x = (d + d^)/\/2,p = —i(d — d))/^/2, the Wigner representation is defined by: 

i r°° 

W(x,p;t) = - dy{x + y\p n {t)\x-y)e- 2 ™, (S2) 

T J-oo 

where \x) are eigenkets of x. For simplicity, we consider the Wigner function in Fourier space using the convention 

W[k,q;t] = J dxdpe^ kx+q ^W{x,p-t). (S3) 

As discussed in the main text, the equation of motion Eq. (5) is solved exactly by taking p-|-j. to have a Gaussian 
form, i.e. the Wigner representation W[a:,p] has a Gaussian form. In Fourier space, we may thus write: 



W[k,q;t] = e~ v ^ 



exp 



i(x(t)k + p{t)q) - X -(k 2 o x (i) + q 2 cT p (t) + 2kqa xp {t)) 



(S4) 



Rewriting the evolution equation Eq. (5) as a differential equation for W(x,p) leads to a closed set of equations for 
the mean values x(t),p(t), the variances (T x (t),a p (t),a xp (t), and v(t) (which sets the trace of Pfj,). These means and 
covariances (as defined above) are equivalent to the definition in the main text involving a quasi-expectation value 
(see text after Eq. (6)). 

To present the equations of motion for the means and covariances which determine pj-^(i), it is useful to introduce 
parameters characterizing the average and difference of the two cavity Hamiltonians H^, corresponding to the two 
qubit eigenstates. For each parameter A in the cavity Hamiltonian of Eq. (3) (A = g, A, a), we thus define: 



5 A = A-f - A_i 



(S5) 



In the following equations, terms involving Sg and SA are a direct result of the different Hamiltonian evolution of the 
cavity associated with the two qubit states. In contrast, terms involving da arise from the non-Linblad dissipation 
terms in the last line of Eq. (5); these terms make no contribution in the case of a linear cavity. 

We first take the trace of Eq. (5), which corresponds to setting k = q = in the Fourier-transformed phase space 
equation. This then yields the equation of motion for the parameter v{t) as given in Eq. (6) in the main text. Writing 



2 



this explicitly, we have: 

v = T^o — i-^-(x 2 + p 2 + a x + o. p — 1) — i \[2 nlm [5a] x + i\/2KRe[5a]p 

T r sr ~ "1 

-iRe[Sg](xp + u xp ) + i — - — (p 2 - x 2 + a p - a x ). (S6) 

where r Vi0 = n\Sa\ 2 /2. 

Similarly, from Eq. (5) we find that the equations for the mean values take the form: 



x 



^— ^ + Re[g]^ x + (~A + I m [.9l) P + iSA(xa x + pcr xp ) + i-^=lm[Sa] (2a x — 1) — iV2tiRe[Sa]a x 



V2 

-iRe[Sg](xa xp + pcr x ) + i\m[5g]{xa x - pa xp ) 
p — — ^ + Re[g]^ p + ^A + Im[g]^ x + idA(pa p + xa xp ) — i-^=Re[8a] (2a p — 1) + iV2~Klra[Sa](T xp 

-ilm[6g] (pa p - xa xp ) - zRc [6g] (p<r xp + xa p ). (S7) 

In each of these equations, the first two terms are just the simple drift equations that one would obtain if the cavity 
was described by the average Hamiltonian (Hf + H±) /2. The remaining terms result from the difference between these 
two conditional Hamiltonians, as well as from the non-Linblad dissipation terms in Eq. (5). 

Finally, from Eq. (5), one finds that the equations of motion for the variances and covariances are 

& x = | + (2Rc[.g] — k 2iRe[5g]a xp )a x - (2A - 2Im[§])a xp + ilm[6g\{a 2 x - a 2 xp + 1/4) + iSA(a 2 x + a 2 xp - 1/4) 



K 

= 2 



a,, = - - (2Re[g] +k- 2iRe[5g]a xp )a p + (2A + 2lm[g])a xp - ihn[5g](al - a 2 xp + 1/4) + i5A(af } + a 2 xp - 1/4) 



& xp = A(a x - (T p ) + lm[g](a x + a p ) - na xp + iSA(a p + a x )a xp - iRe[Sg](a p a x + a 2 xv + 1/4) + ilva[8g](a x - cr p )cr xp . 

(S8) 

Note these equations have the general form of a matrix Riccati equation. 

OBTAINING THE DEPHASING RATE 

We focus on the long-time backaction dcphasing rate of the qubit, which is defined as usual by the decay of the 
qubit off-diagonal density matrix: 

im l I1 W MM _ 

V t^oo t 

As we now show, in the long-time limit, we can also obtain this dephasing rate by considering the decay of the 
doubly-displaced density matrix p-j-j. defined in Eq. (4): 



In 

T v = - lim 



Trp n 



lim Re M. (S10) 



t— >oo t t— >oo t 

To see this we consider the Wigner representation W(x,p) of p-j-j. (c.f. Eq. (S2)), as well as that of p^, 

1 f°° 

W(x,p) = - / (x + y\p n \x - y)e- 2im dy. (Sll) 



Eq. (4) of the main text tells us that the operators p^ and p\± are simply related by a pair of displacement 
transformations. This immediately implies that their Wigner transforms arc simply related by a displacement in 
phase space and an overall phase factor. Writing a a = (u a + iv a )/y/2, and defining 5u,Sv,u,v as per Eq. (S5), we 
have: 

W(x,p) = e i<Kx >riW(x + u,p + v), (S12) 
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where the (real) phase (f>{x,p) is given by 

<fi(x,p) = (5v(x + u) + Su(p + v)) + (vSu — uSv). 



(S13) 



It thus immediately follows that since W(x,p) has a Gaussian form, so will W(x,p). Using Eqs. (S12) and (S4), we 
thus find the Fourier transform of W (defined as in Eq. (S4)) is given by: 



W[k',q'] = e -"exp 
where the parameter v' {t) is given by: 
v' = v + 



i{{x + u)k' + (p + v)q') - -(k' 2 a x + q' 2 <j p + 2k'q'a xp ) 



i 11 

-(Su 2 (Tp + Sv 2 cr x + 2Sv ■ Sua xp ) — -uSv — xSv + -vSu — pSu 



(S14) 



(S15) 



Note that the trace of is just W[0, 0] = e From the definition in Eq. (S9), the dephasing rate is thus 
rigorously given by: 



lim 

t— s-oo t 



(S16) 



While Eq. (S15) implies that in general, v(t) ^ v'(t) (i.e. pf± and pf± do not have the same trace), it also tells us that 
v' (t) — v(t) tends to a constant in the long-time limit, as the covariances a x , a v and a xp all tend to a time-independent 
constant in this limit. We have thus proved Eq. (S10) which expresses the dephasing rate in terms of v (i.e. the trace 
of Pfi)- Thus, by solving Eqs. (S6),(S7) and (S8) we can directly calculate the backaction dephasing rate. 



DEPHASING RATE AND KURTOSIS OF INTRACAVITY PHOTON NUMBER FLUCTUATIONS 

As discussed in the main text, the strong suppression of backaction dephasing at order A 4 can be directly attributed 
to a strong, positive kurtosis (fourth cumulant) of the intracavity photon number fluctuations of the driven nonlinear 
cavity detector. In this section we show that this enhanced kurtosis cannot be obtained if one approximates the 
nonlinear cavity by a detuned degenerate parametric amplifier. Such an approximation is common, and is usually 
justified by first writing the cavity Hamiltonian in a frame where the classical cavity amplitude ao (given by Eq. (2) 
with A = 0) is displaced to the origin. Writing a — a + d, the full cavity Hamiltonian in this frame takes the form: 

H = -KSd + l - {g$$ - g*dd) - A(2a*Sdd + 2a G d) d) d) - A&Sdd. (S17) 

where A = A + 4A|ao| 2 is the effective drive detuning and g — 2iha 2 ) is the parametric interaction strength. The 
standard approach is now to assume a large cavity photon number n — |ct | 2 , and use this to drop terms that are 
cither cubic or quartic in the d, d) operators. The resulting truncated Hamiltonian, Hdpa, is that of a detuned, 
degenerate parametric amplifier (DPA) (i.e. identical to Eq. (3) of the main text with A = 0). We now show explicitly 
that this standard approach underestimates the magnitude the fourth cumulant of photon number fluctuations, even 
in the limit n» 1. 

The time-integral of the cavity photon number fluctuations is described by the operator 

dt' (a^t')a(t') -fi) = f dt' (a S{t') + a^d{t') + S(t')d(t')) . (S18) 
o Jo ^ ' 

Consider the fourth cumulant of m, calculated using the approximate DPA Hamiltonian. We would like to understand 
how this scales with photon-number gain G near the bifurcation; this can be done without explicitly Keldysh-ordcring 
operators. The leading-order-in-n contributions to the fourth cumulant will come from terms of the form 

^11 J* dt ^j ((V^(*i)) • (Vn~d(t 2 )) ■ (S(t 3 )d(t 3 )) ■ (S(t 4 )d(t 4 )) 
~ n(n f dtA (d t (t 1 )^ 3 ))-(rf(t 2 )d t (i 4 ))-(rf t (t 3 )rf(t4)) + ..., 



(S19) 



4 



where in the last line we have applied Wick's theorem, writing only one of the possible pairings explicitly. In the 
long-time limit, only the zero-frequency behaviour of the two-point correlations above will be important. One finds 
that near the bifurcation, these susceptibilities scale as [1]: 

J dt ($ '(t)d(O)^ ~ J dt (d{t)d) (0)J> ~ ^, (S20) 
where we have dropped terms lower order in G. This implies that for a true DPA near bifurcation, 

«™ 4 »DPA ~ ^f. (S21) 

This does not agree with the full calculation presented in the text, which finds that the kurtosis for the intracavity 
photon number fluctuations scales like G 4 near the bifurcation (c.f. Eq. (8) and (7) in the main text). The discrepancy 
is a direct result of neglect of the cubic and quartic terms in d in the full Hamiltonian of Eq. (S17). One can obtain 
the leading G 4 behaviour of the kurtosis (in agreement with the results of the main text) by treating these terms 
perturbatively. This explicitly demonstrates that the phase-space approach of the main text goes well beyond a simple 
linearization of the cavity Hamiltonian. 



MEASUREMENT RATE BEYOND LINEAR RESPONSE 



During a weak measurement information about the system is only acquired slowly, implying that the output 
homodyne current must be integrated over a finite period of time in order to resolve the qubit state. If the measurement 
begins at t = then the time-integrated homodyne intensity, s, is given by 



[ dt'I(t')~ f dt'(d out (t')e-^ + dl ut (t'y*), (S22) 
Jo Jo 



where d out = di n + \fkd is the cavity output field, as defined by input-output theory [4] . Each qubit cigenstate a =f, I 
will lead to a different probability distribution for s: we denote these two distributions p a (s). The measurement rate 
characterizes how quickly these two distributions become distinguishable (i.e. how quickly can we resolve the qubit 
state from the homodyne current). As is standard for weak continuous measurements [2], we define the measurement 
rate r mcas by the long-time decay of the overlap of Pf(s) and p±(s) 



In 



Tineas lini 



Jds ^p t {s)pi(s) 



t— >oo t 



(S23) 



Further, in the long-time limit (i.e. times longer than any internal detector timescale), the central limit theorem applies, 
and the distributions p CT (s) will be Gaussian. Each distribution will thus be fully characterized by the corresponding 
mean and variance of s. These are in turn related to the average homodyne current ( I}^ and zero-frequency homodyne 
current noise 

/oo 
dt{I(t)I(0)) a (S24) 
-oo 

associated with each qubit state. Using these Gaussian forms and directly integrating Eq. (S23), we obtain the 
result quoted in Eq. (9) of the main text. Note that in the limit of small A, Eq. (9) reduces to the standard linear- 
response expression for the measurement rate in the weak-coupling limit [2]. The required mean homodyne current 
and homodyne current noise in Eq. (9) can be calculated directly using the linearized Hamiltonians H a in Eq. (3) 
and input-output theory, following the approach of Ref. [1]. 

While the one can always define a measurement rate in the above manner, it is only a useful quantity when the 
measurement is slow compared to detector timescales, i.e. r mcas • idctcctor < 1- For our nonlinear cavity detector, the 
slowest internal detector timescale is associated with the narrow bandwidth of parametric amplification, and is given 
by isiow ~ VG/k [I]. 
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EFFICIENCY RATIO BEYOND LINEAR RESPONSE 



In the case where the coupling between the qubit and the cavity is weak enough that linear response is valid, there 
exists a fundamental quantum limit on the efficiency of quantum non-demolition (QND) qubit detection. This limit 
states that the best one can do is measure the qubit as quickly as one dephases it [2], i.e.: 

X=%^<1- (S25) 

While this constraint is most easily derived in the limit of a weak qubit-detector coupling [2], it can be generalized 
to the stronger couplings we are interested in. In the general QND case, one can derive a lower bound on the the 
dephasing rate T v by considering the most ideal situation, where the backaction dephasing is completely due to the 
process of information gain. The ideal case corresponds to the two following requirements: 

1. If at t = when the detector-qubit coupled is switched on the qubit is in its eigenstate \a) (er =t>4-)> then at 
time t the detector will be in the pure state |-D CT (t)). 

2. The states \D^(t)}, \Di(t)) are completely determined by the corresponding probability distributions p a (s) of 
the measured detector quantity (i.e. in our case, the integrated homodyne current). 

The first requirement implies that if the qubit is initially in a superposition of its eigenstates (i.e. a\ t) + b\ I)), 
then at at time t it would be entangled with the detector. The detector-qubit system would be described by the state 

|*(t)) = a\ t) ® \D t ) +b\l)® \Di.). (S26) 

Using the definition of Eq. (S9), the long-time dephasing rate would then be given by the overlap of the two detector 
pointer states: 

IV = -limlnK^™ (S27) 

* t^oo t 

The second requirement above further implies that if \s) represents a state with definite value of the detector output 
(i.e. integrated homodyne current), then the states \D a ) can be written: 

lAx) = J ds vOT)|«>- (S28) 

Using the definition of the measurement rate (Eq. (S23)) and the expression for the dephasing rate above (Eq. (S27)), 
we thus find in this most ideal case: 

l- 1 ^ Tineas; (S29) 

i.e. the measurement rate and dephasing rate coincide. This represents a lower bound on the dephasing. In the more 
general case, the overlap of the the detector pointer states will be less than that implied by Eq. (S28), as the qubit 
will also become entangled with degrees of freedom in the detector not directly related to the observed quantity s. 
Explicitly, in the more general case we would have: 

\D a ) = f^J ds ^Ms~)\s)^j ® \E a ). (S30) 

where the states \E a ) describe the additional detector degrees of freedom. If < 1, then there is "wasted 

information" information in the detector (i.e. information on the qubit state in the unobserved degrees of freedom 
described by \E a )), and the dephasing rate will be necessarily less than the ideal value r mcas . We have thus proved 
the quantum limit inequality Eq. (S25) without invoking a small detector-qubit coupling. 
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